Constraints for corpora development and validation
نویسندگان
چکیده
In this paper we consider corpora as a set of XML documents. The guidelines for the creation of the corpora determine the semantics of the data, stored in them. Usually the guidelines prescribe the actual structure of the corpora, the used symbols, their meaning and the relations among them. Ideally, the software supporting the creation of a corpus has to allow all the constraints that follow from the guidelines to be imposed over the XML representation of the corpus. To the best of our knowledge, such software does not exist yet. The main problems come from the complexity of the data in the corpus and the impossibility it to be to completely formalized.
منابع مشابه
Extracting Constraints on Word Usage from Large Text Corpora
Our research focuses on the identification of word usage constraints from large text corpora. Such constraints are important for natural language systems, both for the problem of selecting vocabulary for language generation and for disambiguating lexical meaning in interpretation. The first stage of our research involves the development of systems that can automatically extract such constraints...
متن کاملDevelopment and validation of a moral intelligence questionnaire for Adolescent children of veterans
متن کامل
Optimizing Disparity Candidates Space in Dense Stereo Matching
In this paper, a new approach for optimizing disparity candidates space is proposed for the solution of dense stereo matching problem. The main objectives of this approachare the reduction of average number of disparity candidates per pixel with low computational cost and high assurance of retaining the correct answer. These can be realized due to the effective use of multiple radial windows, i...
متن کاملConstraints to Farmers Willingness to Pay for Private Irrigation Delivery in Nandom, Ghana
The study investigated the constraints to farmers’ intention to pay for private irrigation in Nandom District, Ghana. Using a key informant interviews and semi-structured questionnaires, the study collected data from 236 farmers. Data was analyzed with descriptive and inferential statistics. Kendall coefficient of concordance was used to determine the level of agreement among farmers in ranking...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003